Measuring Human-perceived Similarity in Heterogeneous Collections

نویسندگان

  • Jesse Anderton
  • Pavel Metrikov
  • Virgil Pavlu
  • Javed A. Aslam
چکیده

We present a technique for estimating the similarity between objects such as movies or foods whose proper representation depends on human perception. Our technique combines a modest number of human similarity assessments to infer a pairwise similarity function between the objects. This similarity function captures some human notion of similarity which may be difficult or impossible to automatically extract, such as which movie from a collection would be a better substitute when the desired one is unavailable. In contrast to prior techniques, our method does not assume that all similarity questions on the collection can be answered or that all users perceive similarity in the same way. When combined with a user model, we find how each assessor’s tastes vary, affecting their perception of similarity.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation

Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...

متن کامل

Measuring Concept Similarity of Heterogeneous Ontologies in Multi-angent System

Different kinds of agents in a multi-agent system have different knowledge structure, which results in difficulties of interaction and coordination among agents. At present, ontology based knowledge representation is an effective way of resolving such difficulties, the key of which is heterogeneous ontology concept similarity measuring. In this paper, we designed an integrated heterogeneous ont...

متن کامل

Highly Heterogeneous XML Collections: How to Retrieve Precise Results?

Highly heterogeneous XML collections are thematic collections exploiting different structures: the parent-child or ancestor-descendant relationships are not preserved and vocabulary discrepancies in the element names can occur. In this setting current approaches return answers with low precision. By means of similarity measures and semantic inverted indices we present an approach for improving ...

متن کامل

A framework for comparing heterogeneous objects: on the similarity measurements for fuzzy, numerical and categorical attributes

Real-world data collections are often heterogeneous (represented by a set of mixed attributes data types: numerical, categorical and fuzzy); since most available similarity measures can only be applied to one type of data, it becomes essential to construct an appropriate similarity measure for comparing such complex data. In this paper, a framework of new and unified similarity measures is prop...

متن کامل

Soft Computing A Framework for Comparing Heterogeneous Objects: on the Similarity Measurements for Fuzzy, Numerical and Categorical Attributes

Real-world data collections are often heterogeneous (represented by a set of mixed attributes data types: numerical, categorical and fuzzy); since most available similarity measures can only be applied to one type of data, it becomes essential to construct an appropriate similarity measure for comparing such complex. In this paper, a framework of new and unified similarity measures is proposed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1802.05929  شماره 

صفحات  -

تاریخ انتشار 2018